Supplementary Material for “DivMCuts: Faster Training of Structural SVMs with Diverse M-Best Cutting-Planes”
نویسندگان
چکیده
This section reviews the notation used in the main paper and revisits cutting-plane methods for training structured-output predictors. Notation. For any positive integer n we use [n] as shorthand for the set {1, 2, . . . , n}. We use y for a structured-output, and Y = (y1, . . . ,y|Y|) for a tuple of structured-outputs. Given a training dataset of input-output pairs {(xi,yi) |xi ∈X ,yi ∈Y}, we are interested in learning a mapping f : X → Y from an input space X to a structured output space Y that is finite but typically exponentially large (e.g., the set of all segmentations of an image, or all English translations of a Chinese sentence). Structural Support Vector Machines (SSVMs). In an SSVM setting, the mapping is defined as f(x) = argmaxy∈Y w TΨ(x,y), where Ψ(x,y) is a joint feature map: Ψ : X × Y → Rd. The quality of the prediction ŷi = f(xi) is measured by a task-specific loss function ` : Y × Y → R+, where `(yi, ŷi) denotes the cost of predicting ŷi when the correct label is yi. Since the task-loss ` is typically non-convex and non-continuous, [2] proposed to optimize the hinge upper bound on `. The regularized hinge-loss SSVM learning problem can be formulated as a QP with exponentially many constraints. In this paper, we work with the 1-slack formulation of Joachims et al. [1] in the MarginRescaling variant (1). Optimization Problem 1 (OP1). 1-slack Structural SVM (Margin-Rescaling) Training (Primal) formulation,
منابع مشابه
DivMCuts: Faster Training of Structural SVMs with Diverse M-Best Cutting-Planes
Training of Structural SVMs involves solving a large Quadratic Program (QP). One popular method for solving this QP is a cutting-plane approach, where the most violated constraint is iteratively added to a working-set of constraints. Unfortunately, training models with a large number of parameters remains a time consuming process. This paper shows that significant computational savings can be a...
متن کاملFaster Training of Structural SVMs with Diverse M-Best Cutting-Planes
Training of Structural SVMs involves solving a large Quadratic Program (QP). One popular method for solving this optimization problem is a cutting-plane approach, where the most violated constraint is iteratively added to a working-set of constraints. Unfortunately, training models with a large number of parameters remains a time consuming process. This paper shows that significant computationa...
متن کاملA Sequential Dual Method for Structural SVMs
In many real world prediction problems the output is a structured object like a sequence or a tree or a graph. Such problems range from natural language processing to computational biology or computer vision and have been tackled using algorithms, referred to as structured output learning algorithms. We consider the problem of structured classification. In the last few years, large margin class...
متن کاملA comparative study of performance of K-nearest neighbors and support vector machines for classification of groundwater
The aim of this work is to examine the feasibilities of the support vector machines (SVMs) and K-nearest neighbor (K-NN) classifier methods for the classification of an aquifer in the Khuzestan Province, Iran. For this purpose, 17 groundwater quality variables including EC, TDS, turbidity, pH, total hardness, Ca, Mg, total alkalinity, sulfate, nitrate, nitrite, fluoride, phosphate, Fe, Mn, Cu, ...
متن کاملTraining SVMs Without Offset
We develop, analyze, and test a training algorithm for support vector machine classifiers without offset. Key features of this algorithm are a new, statistically motivated stopping criterion, new warm start options, and a set of inexpensive working set selection strategies that significantly reduce the number of iterations. For these working set strategies, we establish convergence rates that, ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013